Evaluation of a Multi-Resolution Dyadic Wavelet Transform Method for usable Speech Detection
نویسندگان
چکیده
Many applications of speech communication and speaker identification suffer from the problem of co-channel speech. This paper deals with a multi-resolution dyadic wavelet transform method for usable segments of co-channel speech detection that could be processed by a speaker identification system. Evaluation of this method is performed on TIMIT database referring to the Target to Interferer Ratio measure. Co-channel speech is constructed by mixing all possible gender speakers. Results do not show much difference for different mixtures. For the overall mixtures 95.76% of usable speech is correctly detected with false alarms of 29.65%. Keywords—Co-channel speech, usable speech, multi-resolution analysis, speaker identification
منابع مشابه
Multi-scale Edge Detection of Digital Image Based on Improved Mallat Wavelet Decomposition Algorithm
Multi-resolution analysis of digital image has attracted numerous studies since dyadic wavelet transform was introduced to this field. Aiming at achieving accurate and stationary edge detection, an improved Mallat wavelet decomposition algorithm was employed. This algorithm was defined by a low frequency component and three high frequency components. Two-dimensional signal can be reconstructed ...
متن کاملMulti-spectral Image Resolution Refinement Using Stationary Wavelet Transform with Marginal and Joint Statistics Modeling
Abstract. We present a pixel-level fusion method to refine the resolution of a multi-spectral image using a high-resolution panchromatic image. Our approach is an adaptation of the ARSIS method which takes into account the higher-order statistical moments of the wavelet coefficients. The use of the stationary wavelet transform allows the fusion between images of non-dyadic dimension with less “...
متن کاملMulti-resolution Laws’ Masks based texture classification
Wavelet transforms are widely used for texture feature extraction. For dyadic transform, frequency splitting is coarse and the orientation selection is even poorer. Laws’ mask is a traditional technique for extraction of texture feature whose main approach is towards filtering of images with five types of masks, namely level, edge, spot, ripple, and wave. With each combination of these masks, i...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملTexture Classification of Diffused Liver Diseases Using Wavelet Transforms
Introduction: A major problem facing the patients with chronic liver diseases is the diagnostic procedure. The conventional diagnostic method depends mainly on needle biopsy which is an invasive method. There are some approaches to develop a reliable noninvasive method of evaluating histological changes in sonograms. The main characteristic used to distinguish between the normal...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1301.0278 شماره
صفحات -
تاریخ انتشار 2011